A graph convolutional topic model for short and noisy text streams
نویسندگان
چکیده
Learning hidden topics from data streams has become absolutely necessary but posed challenging problems such as concept drift well short and noisy data. Using prior knowledge to enrich a topic model is one of potential solutions cope with these challenges. Prior that derived human (e.g. Wordnet) or pre-trained Word2vec) very valuable useful help models work better. However, in streaming environment where arrives continually infinitely, existing studies are limited exploiting resources effectively. Especially, graph, contains meaningful word relations, ignored. In this paper, aim at graph effectively, we propose novel convolutional (GCTM) which integrates networks (GCN) into learning method learns the simultaneously for streams. each minibatch, our not only can exploit an external also balance old perform on new We conduct extensive experiments evaluate both (Wordnet) built embeddings (Word2vec). The experimental results show achieves significantly better performances than state-of-the-art baselines terms probabilistic predictive measure coherence. particular, when dealing texts drift.
منابع مشابه
Topic Flow Model: A Graph Theoretic Temporal Topic Model For Noisy Mediums
In the modern era, data is being created faster than ever. Social media, in particular, churns out hundreds of millions of short documents a day. It would be useful to understand the underlying topics being discussed on popular channels of social media, and how those discussions evolve over time. There exist state of the art topic models that accurately classify texts large and small, but few a...
متن کاملAn Event Graph Model for Discovering Trends from Text Streams
In this paper, we formally define and study the event graph model based on set theory and multi-relations theory, and discuss the methods of modeling event and event relations in detail. The event graph model is mainly designed to extract the potential events and the relationships between events from massive text streams, and further discover the trends embodied in the contents in text streams....
متن کاملA Convolutional Neural Network based on Adaptive Pooling for Classification of Noisy Images
Convolutional neural network is one of the effective methods for classifying images that performs learning using convolutional, pooling and fully-connected layers. All kinds of noise disrupt the operation of this network. Noise images reduce classification accuracy and increase convolutional neural network training time. Noise is an unwanted signal that destroys the original signal. Noise chang...
متن کاملWord Co-occurrence Augmented Topic Model in Short Text
Topic models learn topics base on the amount of the word co-occurrence in the documents. The word co-occurrence is a degree which describes how often the two words appear together. BTM, discovers topics from bi-terms in the whole corpus to overcome the lack of local word co-occurrence information. However, BTM will make the common words be performed excessively because BTM identifies the word c...
متن کاملShort Text Understanding by Leveraging Knowledge into Topic Model
In this paper, we investigate the challenging task of understanding short text (STU task) by jointly considering topic modeling and knowledge incorporation. Knowledge incorporation can solve the content sparsity problem effectively for topic modeling. Specifically, the phrase topic model is proposed to leverage the auto-mined knowledge, i.e., the phrases, to guide the generative process of shor...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Neurocomputing
سال: 2022
ISSN: ['0925-2312', '1872-8286']
DOI: https://doi.org/10.1016/j.neucom.2021.10.047